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Field of the technology 

This invention pertains to a novel PGTH protein of human brain origin having a 
prostaglandin transport activity and the pgth gene encoding the protein. 

Prior art 

Prostaglandin is a generic name for a series of physiologically active lipids such as 
prostaglandin E, prostaglandin D, prostaglandin F, prostaglandin I, prostaglandin J, etc. 
Prostaglandin is a physiologically active substance inside the body strongly related to control of 
physiological functions such as blood flow rate, sleeping, gastric mucosa protective action, 
thrombus formation, pregnancy, etc., through specific cell membrane or intranuclear receptors. 

Prostaglandin is produced inside cells as a result of eicosapolyenic acids such as 
arachidonic acid, etc., being cut out by phospholipase A2 from the cell membrane and converted 
with cyclooxygenase and various prostaglandin synthetic enzymes by responding to various 
physiological stimuli, and after being released outside the cells, it has autocrine or paracrine 
effects. On the other hand, liberated prostaglandin is also circulated in the blood flow, taken up 
by a specific cell, metabolized and so disappears. 

A trace amount of prostaglandin shows a strong physiological activity, and consequently, 
the production of prostaglandin compounds is strictly controlled by controlling the activity of 
production-related and metabolism-related enzymes. 

However, prostaglandin has been reported to be unable to pass through the lipid double 
layer of the cell membrane by itself. Therefore, as a prostaglandin transport mechanism, the 
presence of a special protein has been presumed in the process of prostaglandin produced inside 
a cell exiting it and the process of prostaglandin circulating in the blood flow being taken up into 
a specific cell. 

As a protein involved in the transport mechanism described above, prostaglandin transporter 
(abbreviated hPGT: human prostaglandin transporter, below) has been reported, but it is not a 
protein involved in the transport of all prostaglandin compounds, and there are many unclear 
points. Consequently, it is thought that if a biological molecule other than hPGT involved in the 
transport mechanism can be elucidated, the biological molecule found might be usable directly as 
a medical drug or indirectly as a compound for studying compounds that might be usable as a 
medical drug. Therefore, the objective of this invention is to identify such a molecule and use it 
as a medical drug or for the development of medical drugs. 
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Presentation of the invention 

The inventors of this invention studied diligently to find the desired protein by using 
genes expressed in the human brain, and as a result, they found the presence of a novel PGTH 
protein (prostaglandin transporter homologue), successfully isolated a pgth gene encoding the 
protein, and they arrived at this invention. 

Specifically, this invention pertains to (a) a protein having the amino acid sequence 
described in sequence No. 1 or (b) a protein having an amino acid sequence with 1 to several 
amino acids deleted, substituted or added to the amino acid sequence of sequence No. 1, and 
having a prostaglandin transport activity. 

Furthermore, this invention also pertains to (c) a gene comprising DNA described 
sequence No. 2 or (d) DNA which can be hybridized with the DNA of sequence No. 2 under 
stringent conditions and which encodes a protein having a prostaglandin transport activity. 

The pgth gene of this invention can be isolated as a cDNA fragment containing the gene 
from a cDNA library of human brain origin. The cDNA library used by the inventors of this 
invention was prepared based on commercially available mRNA of human brain origin from the 
Clonetech Co. 

As a method for identifying the cDNA encoding a protein having a prostaglandin 
transport activity in the cDNA library described above, the method of Ohara, et al., (DNA 
Research 4: p 53, 1997) was used as an extensive cDNA library analysis method using a 
long-chain cDNA library. From a long-chain cDNA library of human brain origin prepared by 
the method of Ohara, et al., 25,000 recombinants are randomly selected, the 5' and 3 ? - base 
sequences of the cDNA from 15,000 clones were determined, and a clone showing homology to 
the gene encoding hPGT already reported from the 5' sequences of all the clones can be found by 
using a DNA analysis program (BLAST and FastA). 

The presence of a region encoding the protein (ORF: open reading frame) in the base 
sequence can be confirmed by a conventional method using a computer program. After 
becoming confident of the presence of the desired gene in the cDNA sequence, the inventors of 
this invention found one ORF in the sequence by utilizing a computer, the gene was named pgth, 
and the protein encoded by the gene was named PGTH. The PGTH of the invention is a protein 
comprising a total of 709 amino acid residues and having a molecular weight of about 80 kd. 

The invention pgth is a gene comprising 2130 bp shown in sequence No. 2. By using this 
pgth and conventional genetic recombination techniques using a suitable host vector system, it is 
possible to prepare a recombinant gene. As a suitable vector, there are plasmids of E. coli origin 
(such as pBR322, pUCl 18, etc.), of Bacillus subtilis origin, (such as pSH19, etc.) yeast origin 
plasmid (such as pUBl 10, pC194, etc.), bacteriophages, animal viruses such as retroviruses, 
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vaccinia virus, etc, etc. At the time of recombination, it is possible to add translation initiation 
and termination codons using suitable DNA adaptors. Furthermore, for gene expression, a 
suitable expression promoter is attached upstream of the gene. The promoter to be used is 
suitably selected depending on the host used. For example, if the host is E. coli, there are T7, lac, 
tip, APL promoters, etc.; if the host is a Bacillus, there are SPO promoters, etc.; if the host is a 
yeast, there are PH05, GAP, ADH promoters, etc.; and if the host is an animal cell, there are 
SV40-origin, retrovirus promoters, etc. 

Furthermore, the gene may be expressed as a fused protein with another protein (such as 
glutathione-S-transferase, protein A, etc.). In the case of a fused PGTH prepared by using such a 
method, a suitable protease (such as thrombin, etc.), may be used to cut out the protein. 

As a host usable in the case of PGTH expression, there are various strains of Escherichia 
coli, various strains of Bacillus subtilis, various strains of the yeast Saccharomyces cerevisiae 
and animal cells such as COS-7, CHO cells, etc. 

As a method for transforming a host cell using the above recombinant vector, a specific 
method conventionally used to transform the selected host cell is used. 

Incidentally, in this invention, DNA which has a DNA sequence other than that shown in 
sequence No. 2 which can be hybridized with the DNA and encodes a protein having a 
prostaglandin transport activity, is also included in the scope of this invention. 

Specifically, DNA which has a DNA sequence, the total length of the pgth sequence, 
partially changed due to various artificial treatments such as random mutations, introduction of 
site-specific mutations, or mutagen treatment, DNA fragment mutation, deletion ligation after 
scission with restriction enzymes, is also included in the scope of this invention in spite of 
having a DNA sequence different from that of sequence No. 2 as long as such a DNA variant can 
be hybridized with pgth under stringent conditions and encodes a protein having a prostaglandin 
transport activity. 

The extent of the above DNA mutation is within the allowable range if the variant has 
90% or higher homology with the DNA sequence of pgth. Furthermore, as an extent of 
hybridization with pgth, Southern hybridization with pgth may be carried out under conventional 
conditions, for example, in the case of probe labeling with a DIG DNA Labeling kit 
(Boehringer-Mannheim Cat. No. 1 175033), hybridization conditions of a DIG Easy Hyb solution 
(Boehringer-Mannheim Cat. No. 1603558) at 32°C and washing of the membrane in a 5X SSC 
solution (containing 0.1% w/v SDS) at 50°C (IX SSC comprises 0.1 5M NaCl and 
0.01 5M sodium citrate). 
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Furthermore, a protein encoded by the gene variant which is highly homologous to pgth 
as described above and has a prostaglandin transport activity is also included in the scope of this 
invention. 

Specifically, a variant having one or more amino acids deleted, substituted or added to 
the amino acid sequence of PGTH is included in the scope of this invention as long as this 
variant is a protein having a prostaglandin transport activity. 

The side chains of the amino acids, which are the constituent elements of proteins are 
respectively different with respect to hydrophobicity, electrical charge, size, etc., but several 
highly conservative relationships in the meaning of practically not affecting the 
three-dimensional structure (it is also called the steric structure) of proteins have been known 
from experiences or actual physicochemical observations. For example, for substitution of amino 
acid residues, there are glycine (Gly) and proline (Pro), Gly and alanine (Ala) or valine (Val), 
leucine (Leu) and isoleucine (He), glutamic acid (Glu) and glutamine (Gin), aspartic acid (Asp) 
and asparagine (Asn), cysteine (Cys) and threonine (Thr), Thr and serine (Ser) or Ala, lysine 
(Lys) and arginine (Arg), etc. 

Therefore, any variant protein due to substitution, insertion, deletion, etc., in the amino 
acid sequence of the PGTH shown in sequence No. 1 can be said to be within the scope of this 
invention if the variation is a variation which conserves the three-dimensional structure of the 
PGTH, and the protein is a protein having a prostaglandin transport activity similar to PGTH. 
The allowable extent of this variation is 90% or higher homology with the amino acid sequence 
shown in sequence No. 1 . 

Industrial application Field 

The abnormal expression of pgth or functional failure of PGTH is presumed to be a 
critical disorder because PGTH has a prostaglandin transport activity, and consequently the 
normal prostaglandin production mechanism of the body is lost. 

Therefore, PGTH itself is considered to be useful as a drug, and on the other hand, pgth 
or PGTH may be used for effectively studying or evaluating a substance having the same 
function as that of PGTH, a substance promoting or inhibiting its function, a substance 
promoting the expression of the gene, etc. 

Best embodiment of the present invention 

This invention is explained further in detail using application examples as follows, but 
this invention is certainly not limited to these application examples. Incidentally, unless 
specified, the experimental procedures used in the following application examples are those 
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described in standard experimental manuals such as Molecular Cloning, 2 n ed. (Cold Spring 
Harbor Laboratory Press, 1989), etc., and the operating manuals in commercially available kits, 
and they can be carried out under the conditions recommended for the respective commercially 
available products such as restriction enzymes, etc. 

Application Example 1 Cloning of pgth 

1) Construction of a long chain cDNA library of human brain origin 

An oligonucleotide (GACTAGTTCTAGATCGCGAGCGGCCGCCC(T) 15 ) containing a 
NotI site was synthesized using a DNA synthesizer (ABI380B). It was used as a primer, and a 
double chain cDNA was synthesized using mRNA of human brain origin as a template and the 
Superscript II reverse transcriptase kit (Gibco BRL). The ligation of the synthetic DNA was 
carried out with the cDNA and Sail site-containing adapter (Takara Shuzo), subsequently, NotI 
digestion was carried out, and cDNA fragments of 3 kb or larger were purified using 
electrophoresis with a 1% concentration of low-melting agarose. 

After ligation of the purified cDNA fragments with a Sall-NotI restriction enzyme-treated 
pBluescriptIISK> plasmid, the recombinant plasmids were introduced into E. coli ElectroMax 
DH10B strain (Gibco BRL) using the electroporation method. Subsequently, 25,000 
recombinants were randomly selected from the library, the recombinant DNAs were extracted, 
and the 5'- and 3*-basc sequences of the cDNAs of 1 5,000 clones were determined. For the 
sequence determination, a PE Applied Biosystem Co., DNA sequencer (ABI PRISM377) and the 
reaction kit from the same company were used. 

2) Selection of clones containing the pgth sequence 

The 5 sequences of all the clones determined in 1) were compared with the sequence of 
hPGT already reported using DNA analytical programs (BLAST and FastA), and as a result, a 
clone named HK07457 showed significant homology. 

3) DNA fragment base sequence determination 

The base sequence determination was carried out using a PE Applied Biosystem Co. 
DNA sequencer and the dye primer method. The sequence was mostly determined using the 
shotgun method, and for a portion of the base sequence, an oligonucleotide was synthesized 
based on the base sequence already determined, and the primer walking method was used to 
determine the entire base sequences of the two chains. The entire base sequence of the cDNA of 
the clone is shown in sequence No. 3. 
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The cDNA contains an ORF encoding a protein (PGTH) comprising 709 residues. A 
termination codon was found to appear in the upstream region of a methionine residue, which 
was an initiation codon of the protein, with the same reading frame. Therefore, the amino acid 
sequence shown in sequence No. 3 was confirmed to be the only possibility as an amino acid 
sequence of the protein encoded by the cDNA fragment. 

Figure 1 shows the amino acid homology between already reported hPGT and the PGTH 
of this invention. The two show high homology, especially, the position of the cysteine residue 
present at the C-terminal of PGTH is preserved, and the 77 th residue glutamine, 561 st residue 
arginine and 614 th residue lysine of hPGT, which are amino acids especially important for the 
transport activity, are also preserved in PGTH. 

Application Example 2 

Confirmation of protein expression by in vitro translation of pgth 

The plasmid containing pgth prepared in Application Example 1 was treated with RNase 
A, subsequently, RNase A was removed using ADVAMAX beads (AGTC Co.), and in vitro 
translation was carried out using a TNT T7 coupled reticulocyte lysate system (Promega Co.) in 
the presence of ( 3 'S)-mcthionine. A portion of the reaction mixture was separated by 
SDS-polyacrylamidc gel electrophoresis (SDS-PAGE), and the analysis carried out using 
BAS-2000 (Fuji Shashin Kogyo). As a result, the presence of a single band at about 80 kd was 
confirmed as shown in Figure 2. 

Application Example 3 

Construction of animal cell expression vector 

1) Amplification of ORF-containing cDNA 

An oligonucleotide (following sequence 1) having a sequence upstream from the 
initiation codon of the protein of sequence No. 3 and oligonucleotide (following sequence 2) 
having a sequence of a portion downstream from the termination codon of the protein and the 
reverse complementary strand chain were synthesized using a DNA synthesizer (ABI Co., Model 
380B). 

Sequence 1 

5-CTGGAGCTCACTGCACTCCAGCAGTC-3 
Sequence 2 

5-AGCTCACACTCGGGAATCCTCTGGCTTC-3 
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The recombinant cDNA containing sequence No. 3 isolated in application example 1 was 
used as a template, the oligonucleotides of the sequences 1 and sequences 2 were used as a 
primer, and the following PCR procedures were carried out using a Takara LA PCR kit Ver. 2 
and the PCR thermal cycler MP (Takara Shuzo). 

cDNA 5uL(10ng) 

1 OX PCR buffer (containing 25 mM Mg ++ ) 5 uL 

2.5 mM dNTP 8 uL 

10 uM Sequence 1 2 uL 

1 r\ . \ a c "y 1 1 T 

1U jilVl C5CCjUCilC^ ^ 

Water 27.5 uL 

LA Taq polymerase 0.5 uL 

Total amount 50 uL 

The PCR cycle was carried out by holding at 94°C for 2 mn, carrying out the reaction at 
98°C for 20 sec, cooling to 68°C at a rate of l°C/2 sec, holding at 68°C for 3 min, at 72°C for 10 
min, and repeating 30 times. 

The above method was used to amplify a DNA fragment (about 2.2 kb) having a portion 

of sequence No. 3. 

2) Subcloning to an animal cell expression vector 

The DNA fragment amplified in l) was fractionated by l% agarose gel electrophoresis. 
After staining the gel with ethidium bromide, the gel containing the desired band observed under 
ultraviolet irradiation was cut out. The extraction of the DNA fragment from the agarose gel and 
purification were carried out using a GENECLEAN II Kit (BiolOl Co.) 

The extracted and purified DNA fragment was subcloned to animal cell expression vector 
pTARGET (Promega Co.) The ligation solution used was a Takara Ligation Kit Ver. 2 (Takara 
Shuzo), and the reaction was carried out with the following composition at 16°C for 1.5 h. 

Extracted and purified DNA fragment l uL (50 ng) 

PTARGET luL(lOng) 

Water 3 uL 

Ligation solution 5_u_L 

Total 10 uL 

The reaction solution after the above reaction was used to transform the E. coli Kl2 strain 
DH5. The transformant was inoculated on an LB agar medium containing 50 ug/mL of 
ampicillin (Amp), 40 ug/mL of 5-bromo-4-chloro-3-indolyl-p-D-galactoside (IPTG) [sic; 
isopropyi-P-D-ihioglucopyranosidc] and 1 00 uM of isopropyl b D-thioga!actopyranoside 
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(X-gal) [sic; 5-bromo-4-chloro-3-indolyl-p-D-galactopyranoside] *and cultivated overnight at 
37°C. 

Each colony that developed on the above plate was inoculated in 10 mL of an LB liquid 
medium containing 50 ug/mL of Amp, cultivation was carried out overnight at 37°C, the 
biomass was collected by centrifugation, and subsequently the recombinant DNA was purified 
using a QIAprep Spin Plasmid Miniprep Kit (Qiagen Co.) to obtain pTARGETpgth. 

3) Determination of the base sequence of the inserted cDNA 

The base sequence determination was carried out using a DNA sequencer (ABI Co., 
Model PRISM377) and the dye terminator method, and the whole base sequence of the two 
chains was determined using the primer walking method. The clone was found to contain all of 
the region between sequences 1 and 2 among sequence No. 3 confirming that the desired gene 
pTARGETpgth had been cloned. 

Application Example 4 

Insertion into CHOkl cells and stable transformant preparation 

The recombinant DNA, pTARGETpgth, prepared in Application Example 2 has a CMV 
promoter upstream of pgth, and if it is inserted into an animal cell, the expression of pgth is 
possible. 

CHOkl cells were cultured in 60 mm diameter plastic Petri dishes. As the culture 
medium, Ham F-12 (Gibco, called growth medium, below) containing 10% fetal bovine serum 
(Dainippon Seiyaku), 50 U/mL of penicillin and 50 ug/mL of streptomycin was used, and culture 
was carried out at 37°C in the presence of 5% C0 2 . When the cell density was 50%, 
LIPOFECT AMINE reagent (Gibco) containing pTARGETpgth prepared in Application Example 
2 was added in a layer over the cells, incubated for 6 h, and, after replacement with the growth 
medium, culture was continued for 48 h. After dispersing the cells with trypsin, the cell 
suspension was placed in a 60 mm diameter plastic Petri dish, and culture was carried out for 24 
h. After removing the culture medium, it was replaced by growth medium containing G418 
(Gibco, final concentration of 500 ug/mL). The G418 medium was changed every 3 days and 
culture continued for 2 weeks. When the cell colonies were observable with the naked eye, 3 
colonies were isolated using stainless steel cups. As a control, only the pTARGET vector 
(Promega Co.) was inserted into CHOkl cells by carrying out the same procedures as those 
described above to isolate a stable transformant. 



* [Editor's note: The compound names and abbreviations are so garbled in the original text that it is impossible to be 
certain whether it should be 40 jig<mL IPTG and 100 |im X-gal, or vice-versa ] 
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washed with a suitable buffer solution containing bovine serum albumin, and culture was 
continued for 20 min using a buffer solution containing ( 3 H)-labeled PGE2 ( Amersham Co.). 
After washing the cells, they were recovered, and the radioactivity taken up was measured. As a 
result, the prostaglandin transport activity of the CHOkl cells with pgth inserted was statistically 
significantly higher than that of the CHOkl cells with only the control vector inserted. 

Application Example 6 

Expression of pgth mRNA in human macrophages loaded with oxidized LDL 

1) Preparation of human macrophages loaded with oxidized LDL and normal monocyte cDNA 

Normal monocyte cDNA was prepared using RNA prepared with Trizol (Gibco BRL 
Co.) from CD14-positive monocytes from human peripheral blood as a template and the 
Superscript II reverse transcriptase kit (Gibco BRL). Human macrophages loaded with oxidized 
LDL were prepared by culturing normal monocytes in a RPMI-1640 medium (Dainippon 
Seiyaku) containing 20% AB serum and antibiotics for 14 days, adding human LDL oxidized 
with copper sulfate using conventional procedures (oxidized LDL) in the final concentration of 
40 (i/mL [sic; dimension incorrect] and continuing culture for 24 h. A method similar to that 
used for normal monocytes was used to prepare cDNA. 

2) Confirmation of pgth mRNA expression by the RT-PCR method 

Oligonucleotides (following sequence 3) having a sequence contained in sequence No. 2 and 
oligonucleotides (following sequence 4) having the sequence of the reverse complementary 
strand were respectively synthesized using a DNA synthesizer (ABI Co., Model 380B). 

Sequence 3 

5-GCTCCTGCCCATTGGACGGCTTTAACC-3 
Sequence 4 

5-TCACACTCGGGAATCCTCTGGCTTC-3 

The cDNA prepared in (1) was used as a template, the oligonucleotides with sequences 3 and 4 
were used as primers, and the following PCR procedures were carried out using a Takara LA 
PCR kit Ver. 2 and the PCR thermal cycler MP (Takara Shuzo). 

cDNA 2 nL (40 ng) 

10X PCR buffer (containing 25 mM Mg ++ ) 1 .5 jiL 

2.5 mM dNTP 2.4 \iL 
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lO^M Sequence 3 0.4 ^L 

1 0 nM Sequence 4 0.4 [iL 

Water 10,15 nL 

LA Taq polymerase 0. 1 5 

Total amount 1 5 [iL 



The PCR cycle was earned out by holding at 94°C for 5 min, carrying out the reaction at 
94°C for 1 min, holding at 58°C for 1 min, furthermore at 72°C for 1 min, and repeating 30 
times. The PCR reaction mixture was fractionated using 1% agarose gel electrophoresis. After 
staining the gel with ethidium bromide, the ultraviolet irradiation was carried out to detect an 
amplified band at about 500 bp. Similarly, the glyceraldehyde 3-phosphate dehydrogenase gene 
amplified primer (G3PDH, Clonetech Co.) was used as the standard cDNA for PCR testing. As a 
result, the expression of pgth mRNA was strongly induced in the macrophages loaded with 
oxidized LDL, as shown in Figure 3. 

Normal monocytes, macrophages loaded with oxidized LDL or equivalent cultured cells 
may be cultured with a test compound added, and subsequently the change in the PGTH mRNA 
may be measured by the method described above to screen any substance controlling PGTH 
mRNA expression. 

Brief description of the figures 

Figure 1 shows comparison of amino acid sequence homology between hPGT and the 
PGTH of this invention. 

Figure 2 shows the results of SDS-PAGE of PGTH expressed using the in vitro 
translation method using pgth. 

Figure 3 shows the results of detection of mRNA for the expression of pgth in human 
macrophages loaded with oxidized LDL using the RT-PCR method. In the figure o shows the 
results for human macrophages loaded with oxidized LDL, and m shows the results for normal 
human monocytes. 

Claims 

(1) A protein of the following (a) or (b). 

(a) Protein comprising the amino acid sequence of sequence No. 1 

(b) Protein comprising an amino acid sequence with one or more amino acids deleted, 
substituted or added to the amino acid sequence of sequence No. 1, and, at the same time, having 
a prostaglandin transport activity. 



(2) DNA of the following (a) or (b). 

(a) DNA comprising the base sequence of sequence No. 2 

(b) DNA which can be hybridized with the DNA of sequence No. 2 under stringent 
conditions and at the same time, encodes a protein having a prostaglandin transport activity 
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SEQUENCE LISTING 

< 1 1 0 > TAISHO PHARMACEUTICAL CO. . Ltd. 

<1 20> Prostaglandin 

<130> P487 

<150> J P 1 0- 22 7723 

< 1 5 1 > 1998-0812 

<160> 3 

<2i0> 1 

<211> 709 

<2 1 2> PRT 

< 2 1 3 > Homo sapience 

<400> 1 



Mel Gly 
Asp Lys 
Gly Lys 
His Asn 
Ala Gin 
Yal G 1 u 
Ala Se r 
Ser Tyr 
Gly Ala 
His Phe 
Asp Met 
Ser Ala 
Thr Glu 



Pro Arg 
Glu Thr 
Ala Ser 
lie Lys 
Leu Met 
Lys Arg 
Phe Asn 
Phe Gly 
lie Leu 
lie Ser 
Pro Gin 
Pro Ala 
Thr Gin 



He Gly 
5 

Lys Ala 
20 

Pro Asp 
35 

Leu Phe 
50 

He Ser 
65 

Phe Gly 

80 
Glu Val 

95 

Ser Arg 
110 

Yal Ala 

125 

Glu Pro 
140 

Asp Phe 
155 

Ser Ala 
170 

His Leu 

185 



Pro Ala 
Thr Mel 
Pro Gin 
Val Leu 
Gly Tyr 
Leu Ser 
Gly Asn 
Val His 
Leu Ala 
Tyr Arg 
Lys Ala 
Pro Ser 
Ser Yal 



Gly Glu Yal 
10 

Gly Thr Glu 
25 

Asp Val Arg 
40 

C y s His S c r 
55 

Leu Lys Ser 
70 

Ser Gin Thr 
85 

Thr Ala Leu 
I 00 

Arg Pro Arg 
1 15 

Gly Leu Leu 
1 30 

Tyr Asp Asn 
145 

Ser Leu Cys 
1 60 

Asn Gly Asn 
175 

Va I Gly He 
1 90 



Pro Gin 
Asn Thr 
Pro Ser 
Leu Leu 
Ser lie 
Ser Gly 
He Val 
Me t He 
Met Thr 
Thr Ser 
Leu Pro 
Cys Ser 
Mel Phe 



Val Pro 
15 

Pro Gly 
30 

Val Phe 
45 

Gin Leu 
60 

Ser Thr 
75 

Leu Leu 
90 

Phe Val 

105 
Gly Tyr 

120 
Leu Pro 

135 
Pro Glu 

1 50 
Thr Thr 

165 
Ser Tyr 

1 80 
Val Ala 

195 



1 
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I le 


Gly 
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Gly 
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Leu 


Ser 


Pro 


Ser 
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Glu 
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Ser 


Cys 
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Gly 
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Asp 


Pro 
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Arg 
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Pro 
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His 
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Gly 
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Ser 
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Leu 


Asp 
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Ser 


Gin 


Val 


Phe 


Tyr 


Thr 
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Cys 


Ser 










530 










535 
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Cys 


Va 1 


Va 1 


Glu 


Gly 


Asn 


Pro 


Val 


Leu 


Al a 


Gly 


Ser 


Cys 


A s p 


Ser 










545 










550 










555 


Thr 


Cy s 


Ser 


His 


Leu 


Val 


Val 


Pro 


Phe 


Leu 


Le n 


Leu 


Val 


Ser 


Leu 
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565 
2 










570 
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Gly Ser Ala Leu Ala Cys Leu Thr His Thr Pro Scr Phi* Mc I Leu 

575 580 585 

lie Leu Arg Gly Val Lys Lys Glu Asp Lys Thr Leu Ala V a 1 Gly 

590 595 600 

He Gin Phe Mel Phe Leu Arg He Leu Ala Trp Met Pro Ser Pro 

605 610 615 

Val lie His Gly Ser Ala He Asp Thr Thr Cys Val His Trp Ala 

620 625 630 

Leu Ser Cys Gly Arg Arg Ala Val Cys Arg Tyr Tyr Asn Asn Asp 

635 640 645 

Leu Leu Arg Asn Arg Phe lie Gly Leu Gin Phe Phe Phe Lys Thr 

650 655 650 

Gly Ser Val lie Cys Phe Ala Leu Val Leu Ala Val Leu Arg Gin 

665 670 675 

Gin Asp Lys Glu Ala Arg Thr Lys Glu Ser Arg Ser Ser Pro Ala 

680 685 690 

Val Glu Gin Gin Leu Leu Val Ser Gly Pro Gly Lys Lys Pro Glu 

695 700 705 

Asp Ser Arg Val 
709 



<210> 2 
< 2 1 1 > 2130 
<212> DNA 

< 2 1 3 > Horco sapience 
<400> 2 

1 0 20 30 40 50 60 

atgggaccca ggatagggee agegggtgag glaccccagg taccagacaa ggaaaccaaa 60 

gecacaatgg gcacagaaaa cicicctgga ggcaaageca gcccagaccc tcaggacgig 120 

eggecaagtg tgUccataa catraagctg ttcgttctgt gccacagcct gctgcagclg 180 

gcgcagcica tgatcteege etarctaaag agctccatct ccacaglgga gaagegctte 240 

ggcctctcca gccagacglc ggggctgclg gcctccttca acgagglggg gaacacagcc 300 

ttgattgtgt ttgtgagcta tltlggcagc cgggtgcacc gaccccgaal gattggctal 360 

ggggctatcc ttgtggcccl ggcgggcclg ctcalgactc Icccgcactt calcicggag 420 

ccataccgct acgacaacac cagccctgag gatatgecac aggacltcaa ggcttccctg 480 

tgcctgccca caacctcggc cccagcctrg gccccctcca alggcaactg ctcaagctac 540 

acagaaaccc agcalctgag Igtggigggg atcatgttcg tggcacagac cclgctgggc 600 
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gtgggcgggg Igcccallca gccctttggc atctcctaca tcgatgactt tgcccacaac 660 
agcaactcgc ccclctaccl cgggaicclg Mlgcagtga ccatgatggg gccaggcctg 720 
gcctMgggc tgggcagcct calgcigcgc clltatgtgg acattaacca gatgccagaa 780 
ggtggtatca gcctgaccat aaaggacccc cgatgggtgg gtgcctgglg gctgggltlc 840 
clcatcgclg ccgglgcagt ggccctggct gccatcccct acttcttctt ccccaaggaa 900 
atgcccaagg aaaaacgtga gctlcagllt cggcgaaagg Icttagcagt cacagactca 960 
cctgccagga agggcaagga ctctccclct aagcagagcc ctggggaglc cacgaagaag 1020 
caggalggcc tagtccagat tgcaccaaac ctgactglga iccagltcat taaaglcttc 1080 
cccagggtgc tgctgcagac cctacgccac cccatcttcc tgctggtggt cctgtcccag 1140 
glatgcttgt calccatggc tgcgggcaig gccaccttcc tgcccaagtt cctggagcgc 1200 
cagttttcca tcacagcctc ctacgccaac ctgctcatcg gclgcctclc cltcccttcg 1260 
glcalcgtgg gcatcgtggl ggglggcgtc ctggtcaagc ggctccacct gggccctgtg 1320 
ggatgcggtg ccclltgccl gctggggatg ctgctgtgcc tctlcttcag cctgccgctc 1380 
tlcttlatcg gclgctccag ccaccagatt gcgggcatca cacaccagac caglgcccac 1440 
cctgggctgg agctgtctcc aagclgcatg gaggcctgct cctgcccatt ggacggcttl 1500 
aaccclgtct gcgaccccag cactcgtgtg gaalacatca caccctgcca cgcaggctgc 1560 
tcaagctggg Iggtccagga tgctctggac aacagccagg ttttctacac caactgcagc 1620 
Igcgtggtgg agggcaaccc cglgctggca ggatcctgcg actcaacgtg cagccatclg 1680 
gtggtgccct tcctgctcct ggtcagcctg ggclcggccc tggcctglcl cacccacaca 1740 
ccctccttca tgctcatcct aagaggagtg aagaaagaag acaagacttt ggctgtgggc 1800 
atccagttca tgltcctgag gattttggcc tggatgccca gccccgtgat ccacggcagc 1860 
gccalcgaca ccacctgtgt gcactgggcc ctgagctgtg ggcgtcgagc tgtctgtcgc 1920 
laciacaata atgacclgcl ccgaaaccgg ttcatcggcc tccagllctl cttcaaaaca 1980 
ggtlctgtga tclgcttcgc ctlagtittg gctgtcctga ggcagcagga caaagaggca 2040 
aggaccaaag agagcagalc cagccclgcc gtagagcagc aatlgctagt gtcggggcca 2100 
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2130 



< 2 1 0 > 3 

< 2 1 1 > 4083 

< 2 1 2 > DNA 

< 2 1 3 > Homo sapience 



<400> 3 

aagtgaccca gggagacaaa cacttggaga tacttggggc Igagtttgag caagactccc 60 

taacctgtgl ctggacaagt ctgatgtcct gtgtggccca agaagaactg accccgtglc 120 

tggagctccc accgltaltg catccctgct gtggctcacc tgctgctglc tccaggagcc 180 

cctgagaaga lllgcctccl ctcccctgct aagctccagg tcctgagati gaatlagggg 240 

ctggagctca ctgcaclcca gcagtc 266 

atg gga ccc agg ata ggg cca gcg ggt gag gia ccc cag gla cca 311 
Met Gly Pro Arg lie Gly Pro Ala Gly Glu Val Pro Gin Val Pro 
5 10 15 

gac aag gaa acc aaa gec aca atg ggc aca gaa aac aca cct gga 
Asp Lys Glu Thr Lys Ala Thr Mel Gly Thr Glu Asn Thr Pro Gly 

20 25 30 

ggc aaa gec age cca gac cct cag gac gtg egg cca agl gtg ttc 
Gly Lys Ala Ser Pro Asp Pro Gin Asp Yal Arg Pro Ser Yal Phe 

35 40 45 

cat aac ate aag ctg ttc gtl ctg tgc cac age ctg ctg cag ctg 
His Asn He Lys Leu Phe Val Leu Cys His Ser Leu Leu Gin Leu 

50 55 60 

gcg cag etc alg ale tec ggc tac eta aag age tec ate tec aca 
Ala Gin Leu Met lie Ser Gly Tyr Leu Lys Ser Ser He Ser Thr 

65 70 75 

gtg gag aag cgc ttc ggc etc tec age cag acg teg ggg ctg ctg 
Yal Glu Lvs Arg Phe Gly Leu Scr Ser Gin Thr Ser Gly Leu Leu 

80 85 90 

gec tec ttc aac gag gtg ggg aac aca gec ttg all gig HI gtg 
Ala Ser Phe Asn Glu Val Gly Asn Thr Ala Leu He Val Phe Val 

95 100 105 

age tat III ggc age egg gtg cac cga ccc cga atg alt ggc tat 626 
Ser Tyr Phe Gly Ser Arg Yal His Arg Pro Arg Met lie Gly Tyr 
HO 115 120 

ggg get ate ctt gtg gec ctg gcg ggc ctg etc atg act etc ccg 
Gly Ala He Leu Val Ala Leu Ala Gly Leu Leu Met Thr Leu Pro 

125 130 135 

cac ttc ate teg gag cca tac cgc tac gac aac acc age cct gag 
His Phe He Ser Glu Pro Tyr Arg Tyr Asp Asn Thr Scr Pro Glu 

140 145 150 



356 



401 



446 



491 



536 



581 



671 



716 
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gel tec clg Igc ctg ccc aca acc 761 



Asp Mel Pro Gin Asp Phe Lys Ala Ser Leu Cys Leu Pro Thr Thr 
155 160 165 

teg gec cca gec teg gee ccc tec aat ggc aac tgc tea age tac 806 
Ser Ala Pro Ala Ser Ala Pro Ser Asn Gly Asn Cys Ser Ser Tyr 
170 175 180 

aca gaa acc cag cal clg agl gig gtg ggg ale alg tic gig gca 851 
Thr Glu Thr Gin His Leu Ser Val Yal Gly He Met Phe Yal Ala 
185 190 195 

cag acc ctg clg ggc gtg ggc ggg gig ccc all cag ccc ill ggc 896 
Gin Thr Leu Leu Gly Val Gly Gly Yal Pro lie Gin Pro Phe Gly 
200 205 210 

ate tec tac ale gat gac til gec cac aac age aac leg ccc etc 941 
He Ser Tyr lie Asp Asp Phe Ala His Asn Ser Asn Scr Pro Leu 
215 220 225 

tac etc ggg ate clg ttt gca gtg acc atg alg ggg cca ggc ctg 9Rfi 
Tyr Leu Gly lie Leu Phe Ala Val Thr Met Mel Gly Pro Gly Leu 
230 235 240 

gec III ggg clg ggc age etc atg ctg cgc ctt lat gig gac alt 1031 
Ala Phe Gly Leu Gly Ser Leu Met Leu Arg Leu Tyr Yal Asp lie 
245 250 255 

aac cag atg cca gaa ggt ggl ate age ctg acc ata aag gac ccc 1076 
Asn Gin Mel Pro Glu Gly Gly He Scr Leu Thr lie Lys Asp Pro 
260 265 270 

cga tgg gtg ggt gec Igg Igg clg ggl tic etc ale gel gec ggt 1121 
Arg Trp Val Gly Ala Trp Trp Leu Gly Phc Leu lie Ala Ala Gly 
275 280 285 

gca gig gec clg get gec ate ccc tac lie ttc ttc ccc aag gaa 1166 
Ala Val Ala Leu Ala Ala He Pro Tyr Phe Phe Phe Pro Lys Glu 
290 295 300 

atg ccc aag gaa aaa cgt gag ctt cag til egg cga aag gle tta 121! 
Met Pro Lys Glu Lys Arg Glu Leu Gin Phe Arg Arg Lys Yal Leu 
305 310 315 

gca gtc aca gac tea cct gec agg aag ggc aag gac let ccc let 12b6 
Ala Val Thr Asp Ser Pro Ala Arg Lys Gly Lys Asp Ser Pro Ser 
320 325 330 

aag cag age cct ggg gag lec acg aag aag cag gat ggc cla gtc 1301 
Lys Gin Ser Pro Gly Glu Ser Thr Lys Lys Gin Asp Gly Leu Val 
335 340 345 

ctg act gig ate cag ttc alt aaa gtc ttc 1346 



cag all gca cca aac 
Gin 



lie Ala Pro Asn Leu Thr Yal lie Gin Phe He Lys Yal Phe 

350 355 360 

ccc agg gig ctg clg cag acc cla cgc cac ccc ate ttc ctg ctg 1391 

Pro Arg Yal Leu Leu Gin Thr Leu Arg His Pro lie Phe Leu Leu 

365 370 375 

gig gtc ctg lec cag gia tgc tig tea tec atg gel gcg ggc alg 1436 

Val Val Leu Ser Gin Val Cys Leu Ser Ser Mel Ala Ala Gly Mel 

380 385 390 

gec acc ttc clg ccc aag He ctg gag cgc cag III tec ate aca 1481 

Ala Thr Phe Leu Pro Lys Phe Leu Glu Arg Gin Phe Ser He Thr 
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395 



400 405 



gcc 



tec tac gcc aac clg etc ate ggc Igc 



etc tec Itc cci teg 152 6 



Ala Ser Tyr Ata Asn Leu Leu lie Gly Cys Leu Ser Phe Pro Ser 



410 



415 420 



gtc ate gtg ggc ate gtg gtg ggl ggc gtc cig gtc aag egg etc 1571 

Vai He Val Gly He Vat Val Gly Gly Val Leu Val Lys Arg Leu 

425 430 435 

cac ctg ggc eel gtg gga tgc ggl gcc cit tgc clg clg ggg atg 1616 

His Leu Gly Pro Val Gly Cys Gly Ala Leu Cys Leu Leu Gly Met 

440 445 450 

clg ctg tgc etc He ttc age ctg ccg etc tic ttt ate ggc tgc 1661 

Leu Cys Leu Phe Phe Ser Leu Pro Leu Phe Phe He Gly Cys 



Leu 



455 



460 465 



tec age cac cag alt gcg ggc ate aca cac cag acc agt gcc cac 1706 

Ser Ser His Gin He Ala Gly He Thr His Gin Thr Ser Ala His 

470 475 480 

cct ggg ctg gag ctg let cca age tgc atg gag gcc tgc tec tgc 1751 

Pro Gly Leu Glu Leu Ser Pro Ser Cys Met Glu Ala Cys Ser Cys 

485 490 495 

cca Kg gac ggc ttl aac cct gle tgc gac ccc age act cgt gtg 1796 

Pro Leu Asp Gly Phe Asn Pro Val Cys Asp Pro Ser Thr Arg Val 

500 505 510 

tgc lea age tgg gtg gtc 184 1 



gaa tac ate aca ccc tgc cac gca ggc 
Glu 



Tyr He Thr Pro Cys His Ala Gly Cys Ser Ser Trp Val Val 
515 520 525 



cag ga t gc t clg gac aac age cag 
Gin 



gtt ttc lac acc aac tgc age 1886 
Asp Ala Leu Asp Asn Ser Gin Val Pbe Tyr Thr Asn Cys Ser 
530 535 540 

tgc gtg gtg gag ggc aac ccc gtg ctg gca gga tec tgc gac tea 1931 
Cys Val Val Glu Gly Asn Pro Val Leu Ala Gly Ser Cys Asp Ser 
545 550 555 

acg tgc age cat clg gtg gtg ccc ttc ctg etc ctg gtc age ctg 1976 
Thr Cys Ser His Leu Val Val Pro Phe Leu Leu Leu Val Ser Leu 
560 555 570 

ggc teg gcc ctg gcc tgt etc acc cac aca ccc tec ttc atg etc 2021 
Gly Ser Ala Leu Ala Cys Leu Thr His Thr Pro Ser Phe Met Leu 
575 580 585 

ate eta aga gga gtg aag aaa gaa gac aag act tig get gtg ggc 2066 
He Leu Arg Gly Val Lys Lys Glu Asp Lys Thr Leu Ala Val Gly 
590 595 600 

ale cag tic atg ttc ctg agg att tig gcc tgg atg ccc age ccc 2111 
He Gin Phe Met Phe Leu Arg He Leu Ala Trp Met Pro Ser Pro 
605 BIO 615 

gtg ate cac ggc age gcc ate gac acc acc tgt gtg cac tgg gcc 2156 
Val He His Gly Ser Ala He Asp Thr Thr Cys Val His Trp Ala 
620 625 630 

clg age tgt ggg cgt cga gel gle tgt cgc tac tac aat aat gac 2201 
Leu Ser Cys Gly Arg Arg Ala Val Cys Arg Tyr Tyr Asn Asn Asp 
635 640 645 

ctg etc cga aac egg ttc ate ggc etc cag ttc tic Itc aaa aca 2246 
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Leu Leu Arg Asn Arg Phe lie Glv Leu Gin Phe Phe Phe Lys Thr 
650 6" 660 

ggl tct gtg ale tgc itc gee tta gtl ttg gel gtc ctg agg cag 2291 

Gly Ser Val lie lys Phe Ala Leu Val Leu Ala Val Leu Arg Gin 
665 S7 ° 675 

ca g gac aaa gag gca agg act aaa gag age aga tec age eel gee 2336 

G1 „ asp Lys Glu Ala Arg Thr Lys Glu Ser Arg Ser Ser Pro .a 
680 685 690 

gta gag cag caa tig cla gtg teg ggg cca ggg aag aag cca gag . 

Val Glu Gin Gin Leu Leu Val Ser Gly Pro Gly Lys Lys Prn Glu 
695 TOO 705 
0 2 396 

gat tec cga gtg iga 

Asp Ser Arg Val 
709 

gclgtcttgg ggccccacct ggceaagagt agcagecaea gcagtaeclc etctgagtec 2456 
lilgcccaag a.lgggtgte aagageeetg tg.tecatte tggcteetce aelaaallge 2516 
tglgtgacU eaggcaagae attgatceie .e.cagce.t Igetlgctag tctgaaecaa 2576 
agagttgltt gggcatttgc tgtgttggce atl.ctggag caagagggte ttcltcclcc 2636 
tteccccage eageeagetg teclggggec aggcltlcct gggtggaaag aagtatacet 2696 
ttccctgggg ccctaggata gcaaaglgag ccalaglggg ecaggelgcc etceatgetg 2756 
ggceeeagec caggle.gea etegeetgga tcaeeltet. tgageet.ag ccatc.cctg 2816 
tcaggtagga atgaaet.gc eageellcag geteglteag clatgaeca. etglgeggtc 2876 
agggtaeaet eagclcteel ceeeaaetec agcagcclll aagaagtgte ectttggegc 2936 
ccectggagg cagagcactg agctggaccc tgggtagael eccacaggga ggaeggaget 2996 
ggeclcagga glgggacacc cagacltggc agggcr.ltca agaggectgt gtgggggccc 3056 
eaggaateet tagetgaagc ggggagac.c actclccalc tcaggaaatl etagcee.tg 3116 
cceteaggga geeaeggt.g agggtgagge ccaacaee.g cettagggee ctgggtggge 3176 
aagtctgggc eclgggglag ggagggagac teaggeeeac acltgggtat tttctaaltl 3236 
cagacaaaca cacactcagc gcgcactcac tgaltcctac acattgecaa gallleacac 3296 
atgtgaccag gggccaccaa agtcectg.g acctttglga ctaggatcct aattlctc.a 3356 
ttttelcetg gglgeetggg tc.gtg.eae etggggeag. gtggataalg tttagtletg 3416 

tgacaetglt M.tggggg. ggcaec.ggl te.ecga.gc elgggetggt gtcaggeeca 3476 
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ggactgtagt 
ctcgtgtct i 

I talctggcc 
aac c t a 1 1 ac 
accagcagtg 
cc t Igcc t tc 
aaagggaggi 

I I aa i aaa t a 
ta tgaggc 1 1 
aaaaaaag t c 
aaaaaaa 



gc tgggagca 
a t ga lecaa t 
t ggt ta t t l c 
c ac c cc ac c t 
gc t t cc tggc 
t gagcagc t c 
acagaagt t c 
t c t gagca t g 
t taggatgt t 
tggaat tala 



g t aa age l c a 
cc 1 1 l tc lac 
c t t gcgggga 
cgc t gggac c 
cc tgtgc tgg 
ccaglgccag 
caat tccct t 
talc latcaa 
tatat tcct l 
ga t a cage 1 1 



gc I c t g t g t a 
a t cagccc 1 1 
ggagagggl t 
t ac Igc t egg 
gggtgggggg 
ggc t I t gaga 
t t t a 1 1 1 1 gc 
cgccaagaa I 
c a t c c c t c 1 1 
a t t a 1 1 aaa t 



algagtgalg 
gt I ttgt 1 1 t 
t gc t a a t c I g 
gaggcagcag 
aagc tggggg 
c 1 1 I c c c a c a 
tggt tggta t 
t tcaaagtc I 
gt t tcccagg 
t tgt tc I tgc 
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clatggcttg 3536 
atggctagtc 3596 
c t cccagccc 365 6 
acagggagee 3716 
cacatgtggc 3776 
tga t aaaaga 383 6 
ctgtaaatgt 3896 
cc 1 1 caacaa 395 6 
1 1 1 tgcaggg 4016 
ataaaaaaaa 4076 
408 3 
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